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Abstract. Let T be a weighted tree with n leaves numbered by the set {1, ...,n}. Let Di j be the 
distance between the leaves i and j. Let -Djj.fc = h (Aj + + A,fe)- We will call such numbers 
"triple weights" of the tree. 

In this paper, we find necessary and sufficient conditions for a set of real numbers indexed by 3-subsets 
of an n-set to be the set of the triple weights of a tree with n leaves. 

By using the same ideas, we give also a characterization, different from the previous ones, for sets indexed 
by 2-subsets of a n-set to be double weights of a tree. 

1 Introduction 

Consider a positive- weighted tree T (that is a tree such that every edge is endowed with a 
positive real number, which we call the length of the edge) with n leaves. Let Dij(T) be the 
sum of the lengths of the edges of the shortest path connecting % and j. We call such number 
"the distance" between the leaves i and j or "double weight" for i and j. 
In 1971 Buneman characterized the metrics on finite sets coming from a tree: 

Theorem 1 (Buneman) A metric (Dij) on {1, ...,n} is the metric induced by a positive- 
weighted tree if and only if for all i,j,k,h € {l,...,n} the maximum of {-Djj + Dfc^, -Dj^ + 
Djh^Dih + Dfcj} is attained at least twice. 

The problem of reconstructing trees from data involving the distances between the leaves has 
several applications, such as internet tomography and phylogenetics; evolution of species can be 
represented by trees and, given distances between genetic sequences of some species, one can try 
to reconstruct the evolution tree from these distances. 

Some algorithms to reconstruct trees from the data (Dij) have been proposed. Among them is 
neighbour-joining method, invented by Saitou and Nei in 1987 (see [NSj . [SK] and |PSt2j ). 
For any weighted tree T, let now 

Di,j,k( T ) = \( D i,j( T ) + D jA T ) + Di,k(T)), 

that is the sum of the lengths of the edges of the minimal subtree with i,j, k as set of leaves. We 
call such numbers "triple weights" of the tree. More generally define the fc-weights of the tree 
Di lt .... t i k (T) as the sum of the lengths of the edges of the minimal subtree connecting ii,....,if.. 



In 2004 Patcher and Speyer proved the following theorem. 

Theorem 2 (Patcher-Speyer). Let k, n E N with n > 2k — 1 and k > 3. A positive-weighted 
tree T with n leaves 1, ...,n and no vertices of degree 2 is determined by the values Dj where I 
varies in the k-subsets o/{l,...,n}. 

It can be interesting to characterize the sets of real numbers which are sets of fc-weights of a 
tree, for instance triple weights of a tree; in fact (I quote Speyer and Sturmfels's paper |SS2| ) "it 
can be more reliable statistically to estimate the triple weights -Djj,fc rather than the pairwise 
distances D^j" . We refer to [SS2] and above all to [PS| for an analysis of this and the references. 
In [BCj Bocci and Cools generalize Buneman's result for A;- weights. 

In this paper we find necessary and sufficient conditions for a set of real numbers indexed by 
3-subsets of an n-set to be the set of the triple weights of a tree with n leaves (see Theorem [TOl 
or HID. 

By using the same ideas we give also a characterization (different from Buneman's one) for sets 
indexed by 2-subsets of a n-set to be double weights of a tree (see Theorem fT2j) . 
Finally, by using the characterization of neighbours we used to deduce the above theorems, we 
propose a slight modification of neighbour-joining algorithm. 

2 

Definition 3 A 2-bell (or a "2-cherry") B in a tree T is a subtree B with two leaves such that 
only one of the inner vertices is not bivalent (that is, it is not a vertex of only two edges); we 
call this vertex "stalk" of the bell and we say that the twp leaves are neighbours. We call the 
path from a leaf of a bell to its stalk "twig " of this leaf. 
A bell is a union of 2-bells with the same stalk. 



B 




Notation 4 • For every n € N let [n] = {1, ....,n}. 

• For any set {Dr^ ^i} of real numbers indexed by the k-subsets ....,«&} of [n] we denote 
D {i u ...,i k } b V D h,-,ik f° r an V order ofh,....,ik- 

• A weighted tree is a tree such that every edge is endowed with a real number called weight o 
length of the edge. 
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Notation 5 For any set {-D^,....,^} of real numbers indexed by the k-subsets {ii, ■■■■,ik} of [n] 
and for any a, a' € [n], we define * a,a the following condition: 

Avn,--,7fc-i ~ D a' r/1 ,....r/ k -i doesn't depend on 71, ....,j k -i G [n] - {a, a'} 

Proposition 6 Let T be a positive-weighted tree with leaves l,....,n with n > 2k — 1. Let 
a, a' £ [n]. Then * a ' a holds for {-D^,...,^ (T)}^ v .. ) j fce [ n ] i/ and on/y i/a and a' /orm a 6eZZ. 

Proof. 

<= Obvious. 

=> We say that an inner vertex is a node if it is not bivalent. 

Let a, a' be such that * a,a holds. Let us suppose that a and a' are not neighbours. Then on 
the path from a to a' there are at least two nodes. Let x, y be two nodes on the path from a 
to a' such that there is no node in the path from x to y (suppose x nearer to a and y nearer to 
a'). 

For every 8 G [n], let S be the node on the path from a to a' such that 

path{a, a) n path(a, 5) = path(a, 8) 

We can divide [n] into two disjoint subsets: 
X = {8 € [n] I 8 is between a and x} 
Y = {8 G [n] I 8 is between y and a'} 

Since n > 2k — 1, one of the two subsets must contain at least k — 1 elements 71,..., Jk-i 
besides one among a or a'; we can suppose it is X; the other one, that is Y, must contain 
another element rj besides a' (since y is a node). Up to changing the names of 71, ..,jk-i ( an d 
correspondingly 71, 7fc_i), we can suppose 

length path(a,~/i) < length path(a, 72) < < length path(a,~fk~i) 




a 



Then we have 

A*,7i,....,7fc_i - Aj',7i,....,7 fc _i = length path(a,^j~T) - length path(a' < 
< length path(a,rj) - length path(a' ,7^) = Dtt,,,^,...,^^ -E>a!, v 
contradicting the assumption * a ' Q '. 



/,72,----,7fc-l 



□ 
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Remark 7 A tree with 5 leaves is homeomorphic to the following tree (with the nodes possibly 
collapsed): 




Proposition 8 Let {Di,j,k}i,j,ke[5] be a set of real numbers. There exists a weighted tree T such 
that Dij t k = Dij t k(T) if and only if there exist a, a', (3, (3', distinct in [5] such that * a ' a and 
*W hold. 

Proof. Let T be a weighted tree with 5 leaves. By the previous remark it is homeomorphic to 
the following tree and we call the leaves and the weights as in the figure: 



a 




1 



Obviously we have: 
D a ,a',j = fi+a + b + c 
D a ,a>,p = h + f2 + a + b + d 
D ar/ ,/3 = fi + f'2 + a + c + d 

D a',j,[3 = fl + f2 + b + C + d 

D at a>,i3> = fi + f2 + a + b + e 
D a ,j,/3' = fi + f2 + a + c + e 
Aj',7,/3' = /i + /2 + & + c + e 
D a ,p,P' = fi + f2 + a + d + e 
D a >,p,fl> = fi + f2 + b + d + e 
D~f,f3,p> = f 2 + c + d + e 
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Let 



S :-- 



One can easily see that 



/ /i + a + 6 + c \ 
fi + h+a + b + d » 
fl+h + a + c + d 
fi+ h + b + c + d 
fl + h + a + b + e 
h+f'2 + a + c + e 
h + h + b + c+e 
fi+f'2 + a + d + e 
i fi + f2 + b + d + e 
\ /2 + c + d + e / 



a,b,c,d,e,fi,f 2 € R 
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-Dq,7,/3 








D a',-f,0 
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D a ,a',/3' 
















D a' ,7,/3' 








D a,0,0' 
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D a',0,0' 






D f,0,0' 


J 



€ R 10 | and fco/d 
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Notation 9 For every set {-Dii,...,i fe }i 1 ,....,i fe e[n] o/ rea/ numbers we say that a = {a±, ....,a r } C 
[n] is a (r-)pseudobell if * ai > a j holds for all i,j. 

We say that a is a complete pseudobell if fl(3 G [n] — a such that * /3 ' Ql holds for all i. 

Now we are ready to state the theorem characterizing the sets of real numbers indexed by 3- 
subsets of [n] which come from a tree. Shortly speaking such a set comes from a tree if and only 
if in [n] there are at least two pseudobells and if we substitute every pseudobell with a point, 
the same condition holds for the new set and so on. 

Theorem 10 Let n > 5. Let {Dij^} i j f ; , € ^ be a set of real numbers. There exists a tree R 
such that Di j k = Dij^(R) for all i,j, k if and only if the following conditions hold: 
1 There exist at least two disjoint 2-pseudobells in [n] 1 . 

Define L as the set obtained from [n] by substituting, for any complete pseudobell a, all the 
elements of the pseudobell with a new leaf Zg_ and define for all a £ a 

&a • — ~^\Dot,a' ~\~ D a X y D a i : x,y) 

for any x,y E [n] — {a, a'}, a' € a and where D. t . is defined in 2 (a a will be the length of the 
twig of a in R) 

For any disjoint pseudobells a, (3 and 7 define 

D z^,j,k ■= D a,j,k ~a a Va G a 

This means that there exist at least distinct a, a '(3, f3' such that * a ' a and *' 3 '' 3 hold; we are not excluding 
that also * a 'P may hold, that is, that a,a'/3,f3' may be in the same pseudobell 
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D 



D 




D a,/3,j -a a -ap Va G a, (3 £ (3 
, — a a — ap — a 7 Va G a, £ (3, 7G7 



T/ien condition 1 holds for D%j,k with i,j, k £ L instead of [re] and so on up to a 5-sei ai the 
last step you would get a set with less than 5 elements, take off less pseudobells). 
2 The definition 



is a good definition, that is the second member doesn't depend on r,s,u G [n] 
Proof. 

=>■ Obvious. 

<= By induction on n. If re = 5, the statement follows from Remark 
Let us prove the induction step. 

First of all observe that, if the definition of D^j is a good definition for i,j G [re], then it is a 
good definition also for i,j G L. 

Besides one can easily prove that the definition of a a for a in a pseudobell a is a good definition, 
that is it doesn't depend either on a' or on x,y (because * a ' a holds). 

We have to prove also that, for any pseudobell a, the definition of D Za j,ki ls a good definition, 
in fact we have defined it as D a j,k ~ a a for any x, y and for some a G a, but we could define 
it as D a ij i. — a a i for another element a' G a. Obviously they are the same (choose x = j and 
y = k in the expressions of a a and a a /). 

Analogously we can prove that the definitions of Dz a ,zn,j and of D ZatZg>z ^ are good definitions. 
By induction assumption we can suppose that there exists a tree T such that 



for any i,j,k G L. We define R as the tree obtained from T by adding, for every pseudobell a of 
[re], a bell with leaves the elements of the pseudobell and stalk z a and a a as length of the twig 
of a for any a G a. 

We have to show that Dij k(R) = D^jk for any k G [re]. If k are elements also of L we 
already know that. 

So we have to show that D^ ^R) = Dij^ for any i,j, k G [re] with some of k not in L, that 
is with some of i,j, A: in a pseudobell of [re]. 
We can distinguish two main cases: 

A) the elements among i,j, k that are in some pseudobells of [re], are in disjoint pseudobells of 
[n] _ 

B) there are at least two among k in the same pseudobell of [re]. 

The proof of case A can be divided into three subcases: 

Al) D a j^(R) = D a j,k for any j, k G [re] D L and a in a pseudobell a in [n] 
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A2) D a> /3j(R) = Da^j for any j € [n] n L, a £ a, (3 £ (3 with a, (3 disjoint pseudobells in [n] 
A3) D at /3^(R) = D a ^ n for any a £ a, (3 £ (3, 7G7 with a, (3, 7 disjoint pseudobells in [n\. 
The proof of case B can be divided into: 

Bl) D a)a 'j(R) = D ata 'j for any j € [n] (1 L, a, a' in a pseudobell a in [n] 

B2) D ol!a / ! p(R) = D ata '^ for any a, a' E a, (3 & ft with a, /3 disjoint pseudobells in [n] 

B3) D a a i a ii{R) = D a a i a ii for any a, a', a" in a pseudobell a in [n]. 

a 1 A n rr>\ de f-°f R ,n ind - ass - i n ^ °^ Dz 2. J ' fc , n n 

A1 J ^a,j,k{ K ) — a « + 1J Za,j,k{ 1 ) — 0'a + J-'z^,j,k " + ^a,j,k ~ «a — ^aj,fc 

A2) £> Q ,/3,j (-R) = D z ^ Z pj (T) + a a + a[3 = D z ^ z ^ + a a + ap = D aj pj -a^-ap + a^ + a^ = D aM 
A3) Analogous. 

Bl) D ata , d (R) def = f R a a + a* + D Z ^(T) = 

VX '= eL a a +a al + l[D z ^ x {T)+D z ^ y {T)+D z ^^ 
D^, X , U (T) + D jiX , y (T) + Dj iVjU (T) + D J>)tt (T)] = 

= Oq, + a a l + g {D Z a_j, x D Z a,j,y + ^Za,j,U + Ac,J/,u) — g (D z ^ jX> y + D z ^ y u + D z ^ XjU + Dj xy + 

Dj : y : u + Dj yXyU ) = 

def. of D z ^,.,. 2 1 

= Oq, + Oq,' + ^{Da,j,x ~\~ D a j^y + Da,j,u ~\~ D x ^y tU ) 2a a — -^{Da,x,y + ^ a,j,ti + fa,i,u + 

Dj,x,y Dj^y^ u + Dj :X: u) ~\~ &a = 
defof_D.,. _ 

Vr,s€[n],_de/o/D.,. 2 m , n n n s _ l m , n , n n , 

— 3 y^ajja' ' J - / a,j,y t J-^aJjU > ^o! ,y,u) 3 K^a^a! ,y t ^a^y^u t - L/ a,a',u t LJ j,o! ,y > 

^j,y,u "t" I-^j,a',u) + 2"(^ a : Q ' ^ct',r,s ~ D a r s ) = 

= ^{D a j a i + Da,j,y + D a j jU + Da',y,u) ^{D a ,a' ,y + ^a.,y,u ~\~ D a ^ a /^ u + Dj^a',y + 

Dj t y : u ~\~ Dj a >^ + ^{Da,a' ,j ~l" D a a > y + D a ,a',u ^j,y,u) g (-^a j,u "i" ^a,j,y ~\~ ^a,y,« "i" D a i j u + 
■Da',j,y "I" Doc',y,u) ~\~ 2(^01' ,r,s ~ ^a,r,s) = 

= D aa /j 

B2) D ata , tP {R) def = f R a a + a a , + D Z ^_{T) + G/3 = 

v -^« eL aa + aQ , a/3 + f^^^T) + D z ^ y (T) + D Z ^ U (T) + Z? X)J/jU (T)] - |[D^ lX)K (r) + 
D z ^y tU (T) + D z ^ xu (T) + D Zf3!X: y(T) + D Z/3i y >u (T) + D Z/3jX)tl (T)] 



7 



a n + a a i + ap + ^(D a> p jX + D a ^ jy + D a ,/?,u + D x ,y,u) — 2a Q - 2ag 



— a a + a a ' + a /3 + 3 (-^za ,«£,a; + D z ^ z ^y + D Z ^ Z ^ )U + D X) y yU ) 3 (D z ^ XtV + D z ^ VyU + D Z ^ X)U + 
Dzp,x,y + D z ^ VjU + D z ^ jX>u ) = 

de/. of £>*„,.,. and of D Z ^ Z/S 

^{D a X y + D a ,y,u ~\~ D a x u + Df) t x,y ~l~ D{jy U + Df3,x,u) ~\~ 0-a ~\~ 0/3 

de/ o/ D.,. 

= ' A*„3 + V = 

Vr,s€[n], def of D.,. 2 1 t\ n 7-1 r> \ 1 / n 7-1 7-1 n 

= 3 (,-^a,/?,a' ~l~ -L/a,/3,y ~\~ J-'a,f5,u ~\~ J-)a',y,u) ~ 3"(.-^ a,a' ,y "i" ^a,y,u "i" ^a,a',u L)f3,a',y 

Vr,se[n],defofD.,. % \_l(D J- D -L D -L D 1 

3 ^ J --'a,/3,a' ~r J -^a,l3,y t ■L-'a t /3,u ~r ^a 1 ,y,u ) 3 y^a^a' ,y t ^a,y,u ~r J-'a,a l ,u ~r ■*~'/3,a',y t 

D/3 : y jU + Dp ta > jU ) + 3(-D a ,a',/3 +fa,a',y +f«,a',u + Df3,y,u) ~ l(D a ,/3,u + D a ,/3,y + D 'a.,y,u + ^a' ,/3,u + 

Da',f3,y ~l~ ^a',y,u) 2~{Da' ,r,s f«,r,s) = 

= A*,a',/3 

B3) D a}Ce / }Ce //(R) ^ = a a + a a / + a a » = 

= 2^^ a ^ a ' Da,x,y ~ D a i X y) + 2(^0', a" + ^a',x,y ~ ^a",x,y) + 2^^ a "> a D a ",x,y ~ ^a,x,y) = 

= \ {D a ^ a ' + £>q', q » + D a n a ) = 

= 3 (^a,a',«" ~i~ D a ^ a ' yS + Da,a',u ~\~ f a",s,«) g(-Da,o",s + D a a " u + D a ,s,u ~\~ ^a',a",s ~l~ 

D a ' ,a" ,u ^o! ,s,u)~^~ 

~t~3 {Da' ,a" ,a D a : a » s + D a ',a",u ~l~ D a s u ) g(-Da',a,s "i" D a i a u + D a ',s,u D a ii a s + D a " au 
D a ",s,u) + 

a,a" ,a' Da,a" ,s ~\~ D a ^ a ii u + Da',s,u) g(-Da,a',s + fa,a',u + f a,s,« "I - ^a",a',s + D a tt a i u + 
D a " ,s,u) 

□ 

Obviously condition 2 can be avoided by supposing that the come from Djj, that is the 

theorem can be stated in the following way: 

Theorem 11 Lei n > 5. Zei {-Di,j}i,jg[n] ^ e a se ^ °/ rea ^ numbers. Let D ^ := + f + 

There exists a tree R such that Dij^ = D i jj t (R) for all i,j,k if and only if the following 
conditions hold: 

- There exist at least two disjoint 2-pseudobells in [n\. 



Define L as the set obtained from [n] by substituting, for any complete pseudobell a, all the 
elements of the pseudobell with a new leaf z a and define for all a G a 

O'a • — ~^\D a a i -\- Da,x,y D a i X y) 

for any x,y G [n] — {a, a'}, a' G a. 

For any disjoint pseudobells a, (3 and 7 define 

D Z a,j,k ■= - a a Va G a 

D z^ zp ,j ■= D a> p t j - a a - a/3 Va G a, (3 G §_ 

Dz^zp^ ■= D a,Pa ~ a a ~ ap - a 7 Va G a, G 0, 7 G 7 

-Then the same condition holds for D^j^ with i,j,k G L instead of [n] and so on up to a 5-set 
(if at the last step you get a set with less than 5 elements, take off less pseudobells). 

Analogously we can prove: 

Theorem 12 Let n > 4. Let {Dij}ij£\ n ] be a set of real numbers. 

There exists a tree R such that Dij = Dij(R) for all i,j if and only if the following conditions 
hold: 

- There exist at least two disjoint 2-pseudobells in \n\. 

Define L as the set obtained from [n] by substituting, for any complete pseudobell a, all the 
elements of the pseudobell with a new leaf Zg_ and define for all a £ a 

O-a • — ~Q\Da,a. 1 Da,x Da' ,x) 

for any x G [n] — {a, a'}, a' £ a 

For any disjoint pseudobells a, (3, define 

D z ^j := D a j - a a Va G a 

Dz^zp '■= D a,p ~ a a - ap Va G a, (3 G §_ 

-Then the same condition holds for Dijk with i,j,k G L instead of [n] and so on up to a J^-set 
(if at the last step you get a set with less than 4 elements, take off less pseudobells). 

Remark 13 Proposition^ assures us that if we have a positive-weighted tree E with no vertices 
of degree 2 and we consider the set {Dij t k(E)} (or {Dij(E)}) then the tree R coming from the 
above theorems is equal to E (or isomorphic to E if we allow vertices of degree 2 in E). 

Remark 14 Obviously R in Theorem [73 is positive-weighted if and only if all a a are positive 
for a G [n] and also for a G L and so on up to a set of 3- elements (that is R is positive-weighted 
if and only if a a > for all a in the sets we get by "pruning" pseudobells until we get a set of 
3 elements). 

Analogously R in Theorems [77J or [TJJ is positive-weighted if and only if a a > for all a in the 
sets we get by "pruning" pseudobells until we get a set S of 5 elements and also for every a in 
all the sets of 4 elements we get by pruning two neighbours of S. 
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We describe shortly Nei-Saitou's neighbours-joining algorithm (N-J algorithm for short) to re- 
construct trees from the data (-Dij)ije[nl (see |NS| . |SKj and [PSt2j ). 
Given (Aj)i,je[n], let 

Si,j := (n - 2) Dij - Y D i,k ~ Y D o,k 

k^i k^j 

Lemma 15 // i and j are neighbours, then Sij is the minimal element in its row and column 
of the matrix S; namely if i and j are neighbours, then 5^ — Sij > for all k and it is > iff 
k is not a neighbour of i and j. 

Theorem 16 If i and j are such that Sij is a minimum in the matrix S, then i and j are 
neighbours. 

The algorithm works in the following way: from {-A,j} calculate the matrix S; let i,j be a 
couple such that Si j is a minimum in the matrix S; by the theorem, we know that, if the data 
come from a tree, then i and j are neighbours; one can calculate the lengths of their twigs 
by the formula |(A,j + Di jX — Dj iX ) for any leaf x and then consider, instead of [n], the set 
L = ([ n ] - {i,j}) U {z}. Define D Z)V = D ijV - ±(A,j + D i>y - Dj )V ) = |(-Aj + D i>y + D j>y ) for 
any y and iterate the process. 

Observe that to calculate S we need 0(n 2 ) elementary operations and to find the minimum of 
the coefficients of S, we need also other 0(n 2 ) elementary operations. So, to find ONE bell 
we need 0(n 2 ) elementary operations. (Then we have to repeat the operations on L, whose 
cardinality is n — 1. So in total we need 0{n 2 + (n — l) 2 + ) = 0(n 3 ) elementary operations.) 

We can modify the N-J method in the following way. 
Given A,j> calculate the matrix (Sij). 

for every column S^ of S calculate the minimum m,-; this minimum will be attained perhaps 
more than one time. Choose the first index i such that Sij attains rrij, precisely let ij = 
min{i\ Sij = mj}. For these n couples of indices (ij,j) one calculate the difference between the 
ij-th column and the j-th column of the matrix D. 

By Proposition El by checking if this difference has all coefficients equal, we find out if ij and j 
are neighbours. 

In this way one can easily see that, for every cherry, we find out all its leaves (let i\, if. € [n] 
be the leaves of a cherry; suppose for instance that i\ < .... < ik', we will get for the column i\ 
the couple (^2,^1), for the column i2 the couple (ii,i2), for the column 13 the couple for 
the column i± the couple (i\,i^)..., in fact if i and j are neighbours, then Si f. — Sij > iff" is 
not a neighbour of i and j). 

So, by 0(n 2 ) elementary operations in all, we have found out ALL the bells of the tree. 
Once found all the bells, one can calculate the lengths of all the twigs as before, for instance if % 
and j are neighbours the length of the twig of i will be |(A,j + D^ x — Dj ;X ) for any leaf x and 
then substitute, for every bell, to all the element of the bell a new point and iterate the process. 
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Obviously we can allow some "noise", by searching, instead of the couples such that the 
difference of i-th column and the j-th column of the matrix D has all coefficients equal, the 
couples such that the coefficients of the difference of i-th column and the j-th column of 
the matrix D differ less than some e. 

Obviously this "pruning" version of N- J algorithm can be a bit quicker than the original one only 
in the cases where at every step there are a lot of bells; besides it could be useful when we can't 
calculate all the tree but we want to know at least the list of all the bells. Finally we observe 
that perhaps an underestimate of Dij for some i,j can be misleading in the reconstuction of the 
tree by classical N-J algorithm and that the characterization of neighbours given by Proposition 
[6] and the above described version of N-J algorithm can detect such misleading. 

In [LYPj . |PStl] and [PSt2] a generalization of N-J algorithm to get trees from /c-weights is 
described; namely let 

*U = £ri £ D(i,j,R)- J2 D(i,R)- £ D(j,R) 

R (k — 2) — subset R (k — 1) — subset R (fc — 1) — subset 

°f [»] - {m} of [n] - {i} of [n] - {j} 

If i,j are such that Sij is the minimum in the matrix S then i and j are neighbours (to calculate 
S and then to find out a bell, we need 0(n k ) elementary operations) and we can proceed as 
before. 

Again by using Proposition [6] we can modify the algorithm finding all the bells by 0(n k ) ele- 
mentary operations. 
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